Sentence Extraction-Based Machine Reading Comprehension for Vietnamese

نویسندگان

چکیده

The development of natural language processing (NLP) in general and machine reading comprehension particular has attracted the great attention research community. In recent years, there are a few datasets for tasks Vietnamese with large sizes, such as UIT-ViQuAD UIT-ViNewsQA. However, not diverse answers to serve research. this paper, we introduce UIT-ViWikiQA, first dataset evaluating sentence extraction-based language. UIT-ViWikiQA is converted from dataset, consisting comprises 23.074 question-answers based on 5.109 passages 174 Wikipedia articles. We propose conversion algorithm create three types approaches Vietnamese. Our experiments show that best model XLM-R $$_{Large}$$ , which achieves an exact match (EM) 85.97% F1-score 88.77% our dataset. Besides, analyze experimental results terms question type effect context performance MRC models, thereby showing challenges

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Modelling Reading Times in Bilingual Sentence Comprehension

Relatively little is known about the interaction between a bilingual’s two languages beyond the word level. This paper investigates the issue by comparing word reading times (RTs) in both L1 and L2 to quantitative predictions by statistical language models. Recurrent neural networks are trained on either a Dutch corpus, an English corpus, or the two corpora combined (i.e., the bilingual network...

متن کامل

Stochastic Answer Networks for Machine Reading Comprehension

We propose a simple yet robust stochastic answer network (SAN) that simulates multistep reasoning in machine reading comprehension. Compared to previous work such as ReasoNet, the unique feature is the use of a kind of stochastic prediction dropout on the answer module (final layer) of the neural network during the training. We show that this simple trick improves robustness and achieves result...

متن کامل

A case for the sentence in reading comprehension.

PURPOSE This article addresses sentence comprehension as a requirement of reading comprehension within the framework of the narrow view of reading that was advocated in the prologue to this forum. The focus is on the comprehension requirements of complex sentences, which are characteristic of school texts. METHOD Topics included in this discussion are (a) evidence linking sentence comprehensi...

متن کامل

S-Net: From Answer Extraction to Answer Generation for Machine Reading Comprehension

In this paper, we present a novel approach to machine reading comprehension for the MS-MARCO dataset. Unlike the SQuAD dataset that aims to answer a question with exact text spans in a passage, the MS-MARCO dataset defines the task as answering a question from multiple passages and the words in the answer are not necessary in the passages. We therefore develop an extraction-then-synthesis frame...

متن کامل

Evaluating Machine Reading Systems through Comprehension Tests

This paper describes a methodology for testing and evaluating the performance of Machine Reading systems through Question Answering and Reading Comprehension Tests. The methodology is being used in QA4MRE (QA for Machine Reading Evaluation), one of the labs of CLEF. We report here the conclusions and lessons learned after the first campaign in 2011.

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: Lecture Notes in Computer Science

سال: 2021

ISSN: ['1611-3349', '0302-9743']

DOI: https://doi.org/10.1007/978-3-030-82147-0_42